618 results found.
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
English German Kamas Russian
Availability:
Freely Available
License:
CC BY NC SA 4.0
Size:
48293 words Production Status:
Existing-used
Use:
Knowledge Discovery/Representation
-
Paper title:Towards Flexible Cross-Resource Exploitation of Heterogeneous Language Documentation Data
-
Paper track:Infrastructural Issues/Large Projects/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Timm Lehmberg | INEL Kamas Corpus 0.1 | /N |
Documentation:
http://hdl.handle.net/11022/0000-0007-CF44-4
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
Dolgan English German Russian
Availability:
Freely Available
License:
CC BY SA NC 4.0
Size:
72912 words Production Status:
Existing-used
Use:
Knowledge Discovery/Representation
-
Paper title:Towards Flexible Cross-Resource Exploitation of Heterogeneous Language Documentation Data
-
Paper track:Infrastructural Issues/Large Projects/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Timm Lehmberg | INEL Dolgan Corpus 1.0 | /N |
Documentation:
http://hdl.handle.net/11022/0000-0007-D822-F
Written
Corpus,
Language Type:
Multilingual
Languages:
English German Spanish italian
Availability:
Freely Available
License:
<Not Specified>
Size:
799 sentences Production Status:
Newly created-in progress
Use:
Named Entity Recognition
-
Paper title:Building Named Entity Recognition Taggers via Parallel Corpora
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Rodrigo Agerri | IXA NLP Group, University of the Basque Country (UPV/EHU) | ES |
| Author 2 | Yiling Chung | IXA NLP Group, University of the Basque Country (UPV/EHU) | ES |
| Author 3 | Itziar Aldabe | University of the Basque Country (UPV/EHU) | ES |
| Author 4 | Nora Aranberri | University of the Basque Country | ES |
| Author 5 | Gorka Labaka | University of the Basque Country (UPV/EHU) | ES |
| Author 6 | German Rigau | UPV/EHU | ES |
| Main Contact | Rodrigo Agerri | IXA NLP Group, University of the Basque Country (UPV/EHU) | None |
Documentation:
https://github.com/ixa-ehu/ner-evaluation-corpus-europarl/
Speech
Corpus,
Language Type:
Multilingual
Languages:
Arabic Czech English German french
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Improving Neural Machine Translation by Incorporating Hierarchical Subword Features
-
Paper track:NLP engineering experiment paper
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Makoto Morishita | NTT Communication Science Laboratories | JP |
| Author 2 | Jun Suzuki | NTT CS Lab. | JP |
| Author 3 | Masaaki Nagata | +81-774-93-5235 | JP |
| Main Contact | Makoto Morishita | NTT Communication Science Laboratories | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English Estonian German Latvian
Availability:
Freely Available
License:
Open Source
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Bilingual Dictionaries
-
Paper title:Bilingual dictionaries for all EU languages
-
Paper track:Multimodality
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Ahmet Aker | University of Sheffield | GB |
| Author 2 | Monica Paramita | University of Sheffield | GB |
| Author 3 | Marcis Pinnis | Tilde | LV |
| Author 4 | Robert Gaizauskas | University of Sheffield | GB |
| Main Contact | Ahmet Aker | University of Sheffield | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch English German Spanish french
Availability:
From Owner
License:
<Not Specified>
Size:
334.4 MByte Production Status:
Existing-used
Use:
Corpus Creation/Annotation
-
Paper title:A Multilingual, Multi-style and Multi-granularity Dataset for Cross-language Textual Similarity Detection
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Jérémy Ferrero | Université Grenoble Alpes | FR |
| Author 2 | Frédéric Agnès | Compilatio | FR |
| Author 3 | Laurent Besacier | LIG | FR |
| Author 4 | Didier Schwab | Univ. Grenoble Alpes | FR |
| Main Contact | Jérémy Ferrero | Université Grenoble Alpes | None |
Documentation:
<Not Specified>
Written
Lexicon,
Language Type:
Multilingual
Languages:
English German Japanese Russian french
Availability:
Freely Available
License:
CreativeCommons-by-sa
Size:
>1,700,000 entries Production Status:
Existing-updated
Use:
Semantic Web
-
Paper title:Dbnary: Wiktionary as Linked Data for 12 Language Editions with Enhanced Translation Relations
-
Paper track:dataset description
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Gilles Sérasset | Université Joseph Fourier - Grenoble 1 | FR |
| Author 2 | Andon Tchechmedjiev | Université Joseph Fourier - Grenoble 1 | None |
| Main Contact | Gilles Sérasset | Université Joseph Fourier - Grenoble 1 | None |
Documentation:
Online at resource URL
Written
Corpus,
Language Type:
Multilingual
Languages:
English Finnish German Spanish french
Availability:
Freely Available
License:
Creative Commons
Size:
40 GByte Production Status:
Newly created-finished
Use:
Language Modelling
-
Paper title:European Union Language Resources in Sketch Engine
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Vít Baisa | Masaryk University | CZ |
| Author 2 | Jan Michelfeit | Lexical Computing Ltd | GB |
| Author 3 | Marek Medveď | Masaryk University | CZ |
| Author 4 | Milos Jakubicek | Lexical Computing | CZ |
| Main Contact | Vít Baisa | Masaryk University | None |
Documentation:
https://www.sketchengine.co.uk/eur-lex/Language Type:
Multilingual
Languages:
Georgian German Russian Ukrainian
Availability:
From Data Center(s)
License:
Creative Commons Attribution 3.0 Unported (CC BY 3.0) http://creativecommons.org/licenses/by/3.0/
Size:
<Not Specified> Production Status:
<Not Specified>
Use:
Language Modelling
-
Paper title:The Multilingual GRUG Parallel Treebank – a New Initiative for Syntactic Annotation of Under-Resourced Languages
-
Paper track:<Not Specified>
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Oleg Kapanadze | <Not Specified> | GE |
| Main Contact | Oleg Kapanadze | Tbilisi State University | None |
Documentation:
The documentation in English is publicly available at http://fedora.clarin-d.uni-saarland.de/grug/documentation.html
Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch English German french italian
Availability:
Freely Available
License:
<Not Specified>
Size:
884603 <Not Specified>Production Status:
Existing-used
Use:
Word Sense Disambiguation
-
Paper title:A new semantically annotated corpus with syntactic-semantic and cross-lingual senses
-
Paper track:General issues
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | RAKHO Myriam | <Not Specified> | None |
| Author 2 | LAPORTE Éric | <Not Specified> | None |
| Author 3 | CONSTANT Matthieu | <Not Specified> | None |
| Main Contact | RAKHO Myriam | Université Paris-Est | FR |
Documentation:
http://homepages.inf.ed.ac.uk/pkoehn/publications/europarl.pdf




